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ABSTRACT 


Introduction: Schizophrenia (SZ) is a neurological disorder, the causative agents of which may be multiple factors like genetic, 
environmental factors, or co-morbidities with other diseases. The actual reason for the occurrence of this disorder is yet to be 
unrevealed. The genes responsible for this disorder are vulnerable to mutations at the chromosomal or protein levels. So identi- 
fication of disease-associated mutations may pave the way for divulging the root cause behind the disorder. In the current study, 
the emphasis had been made on finding the said disease-associated mutations for the disorder through in-silico analyses. 
Methods: The genes and FDA approved antipsychotics were prioritized using text mining approach, which shortlists nine genes 
(COMT, DISC1, DAOA, NRG1, PRODH, RGS4, GRM3, DRD3 and DTNBP7) and seven antipsychotics (Haloperidol, Fluphena- 
zine, Aripiprazole, Clozapine, lloperidone, Lurasidone, and Risperidone). The genes were checked for deleterious or damaging 
mutations using SIFT and PolyPhen servers. 

Results: The SNPs rs6267 and rs4986871in COMT protein were found deleterious with both the servers. SNPs rs2391191 
and rs9558562 were found damaging in DAOA protein. In case of DISC1 protein five SNPs (rs6675281, rs821616, rs3738400, 
1834622148, and rs55795950) were found damaging. NRG1 and RGS4 protein have one deleterious mutation (rs3924999 and 
rs68678746 respectively) and three deleterious mutations (rs450046, rs2870984 and rs397055) were present in PRODH pro- 
tein. The SNPs rs181422088 (in DRD3) and rs16876589 (in DTNBP1) were found deleterious with both the servers. The native 
protein and their mutated form were modelled and docked with the antipsychotics to check their binding energies. 

Conclusion: The results showed that the binding energies between antipsychotics and mutated proteins were lower as com- 
pared to native protein suggesting that mutated proteins bind well and were stable, so a person is prescribed antipsychotics 
to reduce the symptoms of the disorder. Thus, these mutations may be the reason behind the pathophysiology of the disorder. 


Key Words: Antipsychotics, Docking, Genes, /n-silico, Mutations, Schizophrenia. 


live in their world of hallucinations and delusions ?. The dis- 
order is not as common as other psychiatric disorders, and 
the symptoms are very disabling. According to WHO status 
2018, schizophrenia has affected 20 million people in the 


INTRODUCTION 


In the current study, in-silico methods and servers were used 
to predict the mutations in the gene candidate which may 


have a role in the cause of schizophrenia (SZ). /n-silico 
methods/approaches have set foot in modern pharmacology, 
drug identification, and discovery. /n-silico methods have 
several advantages including fast predictions, safety and 
time, and cost-effectiveness ! 


SZ is a stern and chronic disorder, affecting the thinking and 
social behaviour of an individual. People affected with this 
disorder seem like they have lost touch with reality i.e., they 


world, men in their early age from 13 to 25 years, and wom- 
en of age 25-35 years being most susceptible *. The disorder 
may affect the occupational and educational behaviour of an 
individual. People with schizophrenia are more vulnerable 
to die 2-3 times early as compared to the general population 
because of the combination of many diseases like metabolic, 
infectious, and cardiovascular diseases ?. 


SZ is a complex disorder with unknown etiology, with ap- 
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proximately 800 genes (identified through GWAS) which 
may have a role in the susceptibility of SZ *. In many studies, 
it has been found that mutations in the genes are responsible 
for the pathophysiology of the disorder?. The mutations may 
arise during DNA replication or due to environmental fac- 
tors. For about half of the human inherited diseases, amino 
acid substitutions are responsible °. The SNPs (single nu- 
cleotide polymorphism) identification is significant for pre- 
dicting an individual’s risk of developing diseases, response 
to certain drugs and susceptibility to environmental factors 
such as toxins 7. The SNPs are categorized into non-synony- 
mous (nsSNPs) and synonymous mutations. 


The people diagnosed with SZ are treated with antipsychot- 
ics as prescribed by neurologists. There are more than 20 
FDA approved antipsychotics for neurological disorders. 
The drugs are broadly categorized into first-generation or 
typical drugs and second-generation or atypical drugs **. 
The genes and antipsychotics studied are briefly described in 
tables 1 and 2 respectively. 


MATERIALS AND METHODS 


Gene retrieval and prioritization 

The information regarding genes responsible for the cause 
of schizophrenia was obtained using text mining. The text 
mining approaches include information retrieval, clustering, 
document classification, identification of data trends which 
can be used for ranking of genes. Keywords used for ex- 
tracting data from the scientific literature are very crucial 
for gene prioritization. For finding the genes responsible for 
causing schizophrenia PubTator was used, with the search 
terms "schizophrenia genes", "schizophrenia genes SNPs" 
and "schizophrenia genes mutations SNPs". After short- 
listing with PubTator nine prioritized genes COMT, DISCI, 
DAOA, NRGI, PRODH, RGS4, GRM3, DTNBPI and DRD3 
were chosen for further analyses. 


The nsSNPs of two proteins were retrieved from the dbSNP 
database. The SNPs obtained from dbSNP were then ana- 
lyzed using PolyPhen and SIFT servers "°. 


Analysis of nsSNPs by SIFT 

Sort Intolerant From Tolerant (SIFT) is a tool that distin- 
guishes between tolerant and intolerant amino acid substitu- 
tions. The tool is based on the theory that protein function 
is related to protein phylogeny. The functionally important 
residues remain conserved in the sequence while insignifi- 
cant residues vary in the alignment. When protein sequence 
is submitted to SIFT, it sorts the intolerant amino acid from 
tolerant by generating a score output file in which intoler- 
ant residues are highlighted in red. If a query contains single 
amino acid substitutions the tool can predict mutants having 
phenotypic effects |. 


Analysis of ns SNPs by PolyPhen 

Polymorphism Phenotyping (PolyPhen) works on a Bayes- 
ian approach. It predicts the effect of variants by using both 
structure and sequence information. For the identification 
of functional annotation it creates clustered and multiple se- 
quence alignment. The tool also predicts identity and profile- 
based scores with structural properties like B factor, solvent 
accessibility, and hydrophobic propensity, etc. All the prop- 
erties are combined using two Bayesian models, HumVar 
and HumDiv. These two probabilistic models were trained 
on different datasets. HumVar was trained based on differ- 
ences among disease-causing mutations in the UniProtKB 
database and nsSNPs with no disease phenotype. Hum Var 
model is generally ideal for Mendelian diseases because 
it distinguishes the diseased mutation from normal human 
variants. HumDiv distinguishes damaging alleles (having 
known effect on function) from non-damaging alleles. Hum- 
Div model identifies variants with slightly deleterious alleles 
and treats them as damaging. PolyPhen determines predic- 
tion and threshold by calculating true positive and false-pos- 
itive rates and thus predicts the “probably damaging", “pos- 
sibly damaging” and benign residues ". 


Modelling of nine proteins and their mutant 
sequences 

Native and mutant protein structures were analyzed to study 
their stability. The native structure of PRODH, DTNBPI, 
DISCI were not present in the PDB database and other pro- 
teins like COMT, NRGI, GRM3, DAOA, RGS4 and DRD3 
were present in the database but in ligand bounded form. 
Ab-initio protein modelling was performed to determine the 
three-dimensional structure of the native protein (PRODH, 
DTNBPI and DISCI) using the I-Tasser server. Mutated 
residues were predicted after analyzing with SIFT and Poly- 
Phen. Mutated structures were modelled and energy minimi- 
zation was done by KoBaMIN server ?. RMSD (Root mean 
square deviation) values were studied for the deviation of 
mutant structure from the native structure "1, 


Three-dimensional structures of other proteins (COMT, 
NRGI, GRM3, DAOA, RGS4 and DRD3) were present in 
the database but in the bounded form with ligands, so to ana- 
lyze the structure ofthe only protein, the ligand was removed 
from the structure using Pymol ?. 


Screening of antipsychotic drugs 

The drugs were screened as per prescription by neurolo- 
gists for lowering down the symptoms of schizophrenia. 
The screened drugs were treated as ligands. The FDA ap- 
proved drugs were screened using a text mining approach 
which prioritizes the drugs according to their effect on the 
disorder. PubTator was used for the text mining approach 
with keywords “schizophrenia”, “antipsychotic drugs" and 
“FDA approved”. After applying these filters the seven FDA 
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approved drugs (Haloperidol, Fluphenazine, Aripiprazole, 
Clozapine, Iloperidone, Lurasidone, and Risperidone) were 
selected for in-silico docking purpose. The structures were 
downloaded from an open-source PubChem database. The 
canonical SMILES of these compounds were downloaded 
from PubChem and converted into .pdb format using Open 
Babel which is format inter-conversion software !. 


Docking Studies 

AutodockVina in PyRx was used for the molecular docking 
studies of nine proteins and their mutated forms as receptors. 
The ligands were Haloperidol, Fluphenazine, Aripiprazole, 
Clozapine, Iloperidone, Lurasidone, and Risperidone. The 
.pdb format of receptor and ligand is uploaded as an input 
file, which gets converted into PDBQT format. 


RESULTS AND DISCUSSION 


A total of nine genes were shortlisted because of their role 
in the pathophysiology of schizophrenia and the number of 
cited literature of these genes was very high as compared to 
other genes (Figure 1). The number of studies on COMT and 
schizophrenia is shown as 655 (curated on 15-02-2020) on 
PubTator by using keywords *COMT, Schizophrenia". 


There are 510 studies showing association of NRGI gene 
and schizophrenia in PubTator using keywords “NRGI, 
Schizophrenia". In the case of GRM3, there are 78 published 
literature and the keywords used were *GRM3, Schizophre- 
nia". DAOA and PRODH have 145 and 61 literature respec- 
tively and keywords used were “DAOA, Schizophrenia" and 
"Proline dehydrogenase gene, schizophrenia" respectively. 
In case of DTNBPI, DISCI, RGS4 and DRD3 the result 
comes to be of 314, 700, 113, and 310 respectively. 


SNP dataset of human proteins from dbSNP 

SNPs in the studied genes viz. COMT, NRGI, RGS4, 
DISCI, DTNBPI, GRM3, DRD3, DAOA and PRODH were 
retrieved from dbSNP database. In the case of COMT, a to- 
tal of 8235 SNPs were present which include 298 coding 
non-synonymous SNPs and 336 non-coding SNPs. 541 non- 
coding SNPs were located in 3’ UTR region and 320 SNPs 
in 5' UTR region. The rest of the SNPs were distributed in 
the intron region (7791), stop gained region (19), synony- 
mous coding region (177) etc. For NRGI a total of 260749 
SNPs were present that includes 892 coding non-synony- 
mous SNPs and 2195 non-coding SNPs. Among non-coding 
SNPs, 3053 were present in 3' UTR region and 1198 were 
located in 5” UTR region. In RGS4 2663 SNPs were present 
having 203 coding non-synonymous SNPs and non-coding 
SNPs were not available for this gene. Total 549 non-cod- 
ing SNPs were located in 3' UTR region and 37 SNPs in 5' 
UTR region. For DISCI a total of 93028 SNPs were present 


included 786 coding non-synonymous SNPs and 4384 non- 
coding SNPs. Among non-coding SNPs, 2587 were present 
in 3’ UTR region and 37 were located in 5’ UTR region. In 
the case of DTNBPI, a total of 31557 SNPs were present 
which include 342 coding non-synonymous SNPs and 2010 
non-coding SNPs. 480 non-coding SNPs were located in 
3' UTR region and 364 SNPs in 5' UTR region. In GRM3, 
50136 SNPs were present having 777 coding non-synon- 
ymous SNPs and 2395 non-coding SNPs. 509 non-coding 
SNPs were located in 3’ UTR region and 301 SNPs in 5’ 
UTR region. In DRD3, 16366 SNPs were present having 250 
coding non-synonymous SNPs and 440 non-coding SNPs. 
Overall 205 non-coding SNPs were located in 3' UTR region 
and 235 SNPs in 5’ UTR region. In the case of DAOA, a 
total of 7208 SNPs were present which include 249 coding 
non-synonymous SNPs and 192 non-coding SNPs. 119 non- 
coding SNPs were located in 3’ UTR region and 205 SNPs 
in 5’ UTR region. In PRODH, 5329 SNPs were present hav- 
ing 450 coding non-synonymous SNPs and 540 non-coding 
SNPs. 478 non-coding SNPs were located in 3’ UTR region 
and 62 SNPs in 5’ UTR region. In the present study, nsSNPs 
(coding non-synonymous SNPs) and non-coding SNPs in 3' 
and 5’ UTR regions were studied. 


Damaging nsSNPs predicted by SIFT program 
SIFT was used to study nsSNPs, the nsSNPs predicted from 
genes were submitted to SIFT for the analysis of damaging 
mutations or tolerance indices. Lesser the functional impact, 
higher is the tolerance index of amino acid and vice versa. 
The results of SIFT for nine genes are shown in table 3. 


Total 536 nsSNPs for COMT were uploaded to SIFT for the 
tolerance index analysis, 17 SNPs were found to be dam- 
aging or deleterious having tolerance index of < 0.005. 202 
nsSNPs of DAOA were submitted to SIFT out of which 23 
were found deleterious each having tolerance index of 0.00. 
In the case of DISC1, 1091 nsSNPs were submitted to SIFT, 
28 of which were found to be damaging/deleterious with 
tolerance index < 0.005. Only 20 nsSNPs were deleterious 
out of 471 nsSNPs submitted to SIFT in case of DRD3. The 
365 nsSNPs of DINBP1 were checked for tolerance index, 
showing 20 deleterious nsSNPs. For GRM3 434 nsSNPs 
were submitted to SIFT for the analysis of tolerance index, 
resulting in 38 deleterious nsSNPs. Total 1152 nsSNPs for 
NRGI were predicted for deleterious nsSNPs, which results 
in 61 deleterious nsSNPs. PRODH has 426 nsSNPs out of 
which 31 nsSNPs were deleterious having tolerance index 
< 0.005. Finally, 286 nsSNPs were searched for deleterious 
SNPs showing 4 deleterious nsSNPs. 


PolyPhen server for damaging SNPs 

Polyphon server allows structural level changes in the pro- 
tein. The nsSNPs of genes were also submitted to the Poly- 
Phen server. PolyPhen searches the damaging SNPs and 
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uses GRCh37/hg19 as a reference genome. The results by 
PolyPhen are shown in table 4. PolyPhen uses PSIC (posi- 
tion-specific independent count score) difference of 1.1, nsS- 
NPs above this range is considered deleterious. For protein 
COMT, two SNPs rs6267 and rs4986871 were found pos- 
sibly damaging and probably damaging respectively. In the 
case of DAOA, no SNPs were found damaging and only two 
SNPs were listed in the PolyPhen as rest of the SNPs were not 
found in the UniProtKB. In DISC1, five SNPs (rs6675281, 
rs821616, rs3738400, rs34622148, and rs55795950) were 
found probably and possibly damaging. DRD3 has only one 
damaging SNP (rs181422088) according to PolyPhen. For 
DTNBPI only one SNP (rs16876589) was found damag- 
ing. In the case of GRM3 and NRGI, no SNP and one SNP 
(rs3924999) were found damaging respectively. For PRODH 
seven SNPs (rs450046, rs2238731, rs2870984, rs2904551, 
rs2904552, rs3970559, and rs3970555) were found damag- 
ing and in RGS4 there were no damaging SNPs. 


Modelling of nine proteins and their damaging 
mutant structures 

The nine genes (COMT, DISCI, DRD3, RGS4, GRMS, 
PRODH, NRGI, DTNBPI and DAOA) translate to their 
respective proteins. Human COMT protein structure was 
available in protein data bank (4PYT), thus this 3D structure 
was used as a native structure for COMT mutant structure 
modelling. The SNPs which were shown deleterious or dam- 
aging by both PolyPhen and SIFT server were predicted as 
functionally important mutations. The two protein mutations 
occurred at two SNPs rs6267 and rs4986871. The mutations 
were at position 146 (AV) and 72 (AS). These protein 
mutant structures were modelled using COMT protein as the 
reference model. COMT protein and its two mutants (A146V 
and A72S) were uploaded in SwissPDB viewer for the energy 
minimization. Table 5 shows the total energy after minimi- 
zation of COMT and its two mutant structures (A146V and 
A72S) which were found to be -9350.167 kJ/mol, -9302.922 
kJ/mol and -9364.405 kJ/mol respectively. The RMSD value 
of COMT with A146V mutant and A72S mutant was found 
to be 2.682 A^and2.689 A? respectively. If the RMSD differ- 
ence of two protein is higher, the deviation in their structure 
is also higher thus greater is the change in their functional 
activity. From table 5, it is clear that the RMSD values of two 
COMT proteins are higher as compared to native protein, so 
these SNPs can affect the protein's structure and function. 
Among two mutants total energy and RMSD values of A728 
is greater than A146V mutant. Therefore, A72S mutation is 
predicted to be more deleterious and affect the functional ac- 
tivity of COMT protein. 


The SNPs of DAOA (rs2391191 and rs9558562) were found 
damaging and deleterious according to Polyphen and SIFT, 
which may be responsible for altering the function of the 
protein. The SNP rs2391191 at position 30 where amino acid 


R (Arginine) substitutes into K (Lysine) and SNP rs9558562 
at position 62 where amino acid K (Lysine) substitutes into 
E (Glutamic Acid). Human DAOA was available in protein 
data bank (3 W4K) but in the bounded form so the protein 
was modelled using 3W4K model as a reference structure 
with the help of I- Tasser. The server-generated 2 models and 
model 2™ was found to be the best model after checking the 
quality by SAVES 5.0 Server. The predicted model was also 
checked for Ramachandran Plot which depicts the 92.996 
amino acid residues in the most favoured region, 2% in the 
allowed region and none amino acid was found in the disal- 
lowed region. The model had passed all the parameters and 
hence was used as a native structure for modelling of mutant 
structures of DAOA protein. The energy of modelled struc- 
tures was minimized which was found to be -1739.696 kJ/ 
mol for native DAOA, -1800.172 kJ/mol for mutant R30K 
and -1562.715 kJ/mol for K62E mutant. Also, the RMSD val- 
ue for R30K with DAOA was 2.43 A? and K62E with DAOA 
was 1.83 A?. Thus the mutation (R30K) can be predicted to 
affect the function and structure of the protein as its RMSD 
value is higher than the native protein (Table 5). In DISCI, 
the five SNPs (156675281, rs821616, rs3738400, rs34622148 
and rs55795950) were found deleterious and damaging us- 
ing Polyphen and SIFT. The SNPs rs6675281, rs821616, 
rs3738400, rs34622148 and rs55795950 were found at posi- 
tion L607F, S704C, GSV, L330F and T328N respectively. 
The 3D model of DISCI was available in protein data bank 
(5Y14) but in the bounded form so the structure was modelled 
using 5Y14 as a template. I-Tasser server was used to gener- 
ate the model and was also checked using the Ramachandran 
Plot for the stability of the modelled structure. 93.2% resi- 
dues were found in the favoured region, 3.4% in the allowed 
region and 3.4% in the outer region thus this model was fur- 
ther used for the analysis. The mutations of DISCI protein 
was also modelled using DISCI as native structure and their 
energy minimization and RMSD value were calculated. The 
energy minimization and RMSD value of native DISCI were 
found to be -3368.829 kJ/mol and 2.30 A? respectively. From 
the table 5, it is clear that mutants L607F, S704C, T328N 
have higher RMSD value (2.6 A^, 2.49 A? and 2.52 A?) from 
the native DISCI protein and hence they can affect the pro- 
tein structure and function. The SNPs of DRD3 were also 
checked using SIFT and Polyphen and there was only one 
dbSNP (rs181422088) which was found deleterious and 
damaging by both the servers. The SNP rs181422088 alters 
the amino acid valine at position 157 to isoleucine. The 3D 
model of DRD3 protein was already available in protein data 
bank (3PBL) so the model was used as a native structure for 
the mutant modelling. The energy minimization and RMSD 
value of the mutant structure (V1571) were found higher than 
the native structure as shown in table 5. The RMSD value 
and energy minimization of native DRD3 protein were found 
to be 3.65 A? and -18798.566 kJ/mol respectively whereas 
its mutant form (V1571) have higher energy -19337.301 kJ/ 
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mol and RMSD value 3.89 A“. Thus the mutant V157I can 
affect the structure and function of DRD3 protein. DTNBPI 
protein has only one SNP (rs16876589) which is found del- 
eterious and damaging by SIFT and Polyphen both. Also, 
the 3D structure of DTNBPI was not available in a protein 
data bank so the structure was modelled using I-Tasser. The 
structure quality and stability of protein were checked us- 
ing the Ramachandran Plot. The Ramachandran Plot depicts 
that the amino acid fall in the favourable region was 94.796, 
residues fall in the allowed region was 3.2% and 2.1% in the 
outer region (Figure 3). The RMSD value and energy after 
minimization was also predicted which was found to be 2.54 
A? and -4446.120 kJ/mol respectively for native DNTBPI. 
These values were found much higher in case of its mutant 
structure which is 2.78 A? and -4570.659 kJ/mol (Table 5). 
In the case of GRM3 protein, no common SNPs were found 
to deleterious using both Polyphen and SIFT. Thus, analy- 
sis of GRM3 was not performed further. NRGI protein had 
one deleterious SNP (153924999) common in both Polyphen 
and SIFT. The rs3924999 substitutes amino acid arginine 
to glutamine at position 38. The 3D structure of the NRGI 
protein was not available in a protein data bank so the pro- 
tein was modelled using I-Tasser. The structure was further 
checked for the stability by Ramachandran Plot which plots 
the amino acid residues, 93.3% of residues were present in 
the favourable region, 5.7% in the allowed region and 1% 
in the outer region. The energy minimization and RMSD 
value of native NRGI protein were predicted as -4865.479 
kJ/mol and 2.3 A? respectively. The R38Q mutant of NRGI 
protein as energy minimization and RMSD value -5471.062 
kJ/mol and 2.9 A? which is more than native protein and 
hence can be predicted to affect the structure and function 
of the protein. SIFT and Polyphen analysis of PRODH pro- 
tein results in three common deleterious or damaging SNPs 
(rs450046, rs2870984 and rs3970555). The rs450046 sub- 
stitute amino acid glutamine with arginine at position 521, 
1s2870984 replaces the rionine with methionine at posi- 
tion 446 and rs3970559 substitute amino acid arginine with 
cysteine at position 453. The 3D structure of PRODH protein 
was not available in a protein data bank so it was modelled. 
The structure was further validated for its stability by Ra- 
machandran Plot. The plot predicts that 93.5% of amino acid 
residues are present in the favourable region and 6.5% are 
present in the allowed region. Thus this model was used as a 
native model for the mutant proteins of PRODH. The RMSD 
and energy minimization of native and mutant proteins was 
predicted which results in high RMSD values and higher 
energy minimization (Table 5) than native protein structure. 
Hence these SNPs can affect the structure and function ofthe 
protein. For RGS4 protein, only one deleterious and damag- 
ing SNP was found by Polyphen and SIFT. The SNP was 
rs68678746 which substitute arginine to tryptophan at po- 
sition 134. The 3D model of RGS4 was already available 
in protein data bank (1EZT) and the model was used as a 


native structure against modelling of mutant structure. After 
the prediction of energy minimization and RMSD values, it 
was found that SNP rs68678746 can alter the structure and 
function of protein RGS4. The modelled structure of all the 
proteins and their respective mutants are shown in figure 2. 


Assessment of stabilizing residues among na- 
tive and mutant structures 

S Ride server was used for the identification of stabilizing 
residues between native and mutant protein structures. The 
server predicts the stabilizing residue for all nine genes and 
their respective mutant structures. There was 06 stabiliz- 
ing residue in native COMT structure as well as in the mu- 
tant structure of COMT as highlighted in Table 6. Higher 
the number of common stabilizing residue the mutation is 
predicted to be deleterious. Thus both mutations (A146V 
and A72S) can be an important candidate for schizophre- 
nia caused by COMT protein. In the case of DAOA protein, 
there was 16 stabilizing residue in native DAOA protein, 
14 stabilizing residues in R30K DAOA mutant protein and 
13 stabilizing residues in K62E mutant DAOA protein. 14 
and 13 stabilizing residues were found common in R30K 
and K62E mutant DAOA protein respectively. As a higher 
number of stabilizing residue is common in R30K mutant 
protein, thus the mutation R30K is predicted to be more del- 
eterious as compared with K62E mutation. Hence the muta- 
tion from arginine to lysine at position 30 of DAOA protein 
is predicted to be more damaging and could be an important 
candidate for schizophrenia caused by DAOA protein. For 
DISCI, 04 stabilizing residue was present in the native struc- 
ture and no stabilizing residues were found in any of the five 
mutant structures (L607F, S704C, G5V, L330F and T328N). 
Thus all the five mutant structure could be an important 
candidate for schizophrenia caused by DISCI protein. Only 
one stabilizing residue was found in native DRD3 protein 
structure and no stabilizing residue was present in its mutant 
structure (V157I). Hence the mutation can be deleterious 
and responsible for causing schizophrenia. Two stabilizing 
residues were found for native DTNBPI and 05 stabilizing 
residues were found in G214D mutant structure of DTNBPI, 
out of which two residues were common. Thus the mutation 
G214D was predicted to deleterious and can be an important 
candidate for schizophrenia caused by DTNBPI protein. In 
NRGI protein 06 stabilizing residues were found and no sta- 
bilizing residue was found in its mutant structure. 13 stabiliz- 
ing residues were present in native PRODH protein and 12, 
07 and 09 stabilizing residues were found in Q521R, T466M 
and R453C mutants respectively. Nine, six and nine stabiliz- 
ing residues were common in Q521R, T466M and R453C 
mutant structure respectively. As a higher number of stabi- 
lizing residue is common in Q521R and R453C mutant pro- 
tein, thus the mutations Q521R and R453C are predicted to 
be more deleterious as compared with the T466M mutation. 
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Hence the mutations are predicted to be more damaging and 
could be an important candidate for schizophrenia caused by 
PRODH protein. In native RGS4 and mutant RGS4 protein, 
there is only one stabilizing residue. Thus R134W mutant 
protein can be predicted to be deleterious and hence can be 
an important candidate for schizophrenia caused by RGS4 
protein (Table 6). 


Screening of antipsychotic drugs 

Although there are many FDA approved antipsychotics 
available but they are only responsible for lowering down 
the symptoms. The text mining of literature through PubTa- 
tor suggests that there are seven antipsychotics in priority 
which are mostly given to patients with schizophrenia. 


Docking 

The antipsychotics compounds are treated as ligands that 
were docked against receptors (proteins and their mutant 
structures). Molecular docking calculates the strength of 
association and predicts the orientation of molecules when 
ligands and receptors are bound with each other. The pdb 
structure of ligands (antipsychotics drugs) and receptor (pro- 
teins and their mutated form) were uploaded as input file 
in PyRx. The studied proteins DTNBPI, COMT, NRGI, 
PRODH, RGS4, DRD3, DAOA, DISCI and their damaging 
mutant forms were docked against drugs aripiprazole, clo- 
zapine, fluphenazine, haloperidol, iloperidone, risperidone 
and lurasidone (Table 7). The native proteins were used as 
a positive control for drugs against their respective mutant 
forms. The drug clozapine has a binding affinity of -5.7 kcal/ 
mol with DTNBPI protein whereas, in case of its mutant 
form (G214D), glycine at position 214 gets substituted by 
aspartic acid showed a less binding affinity with clozapine 
(-6.8 kcal/mol) as compared to the native protein. Thus if 
protein DTNBP1 gets mutated the antipsychotic drugs binds 
well with the protein by blocking its active site. So when 
a diseased person is given antipsychotic drugs it works by 
lowering the symptoms of the disorder. Likewise, the other 
drugs in case of mutated protein bind well with each other 
in comparison with native protein, illustrating that if protein 
(DTNBP1) mutates (G214D) there is a change in binding en- 
ergy and the activity of the protein can get blocked, making 
drug effective. The docked structure of DTNBPI and DTN- 
BPI G214 with Lurasidone is shown in figure 4. The pro- 
tein DRD3 also has one damaging polymorphism i.e, V1571, 
which means amino acid valine at position 157 gets mutated 
into isoleucine. The drug which had the least binding activ- 
ity with DRD3 and its mutant form was risperidone having 
total energy of -9.4 kcal/mol and -10.4 kcal/mol respectively. 
The mutant DRD3 protein binds well with the risperidone, 
blocking the active site of protein thus drugs works well on 
the protein showing that if protein gets mutated, the drug 
can bind well with protein. There are two damaging mutants 


for COMT protein which may affect the pathophysiology of 
schizophrenia. Risperidone showed the least binding energy 
with COMT protein (-9.4 kcal/mol) and its mutated form 
A72S (-10.9 kcal/mol) and A146V (-9.4 kcal/mol). The poly- 
morphism A72S has less binding energy than native protein 
and polymorphism A146V has similar binding energy as of 
COMT protein. Thus mutant A72S may be responsible for 
the cause of schizophrenia as the drugs bind well with the 
mutant form of COMT. The second mutant A146V does not 
show much energy deviation illustrating that this mutation 
may not affect the pathophysiology of the disorder. DAOA 
is also one of the susceptible genes for schizophrenia which 
had two deleterious mutations R30K and K62E. In case of 
native protein DAOA, drug risperidone showed the least 
binding activity of -8.9 kcal/mol, but its mutant form R30K 
had less binding energy -9.7 kcal/mol as compared to the na- 
tive protein. K62E mutant had greater binding energy with 
clozapine (-8.7 kcal/mol) in comparison with native DAOA 
protein, but the drug clozapine showed less binding energy 
(-7.4 kcal/mol) as compared with native protein (-6.2 kcal/ 
mol). Thus both the mutants R30K and K62E may have a 
role in the cause of schizophrenia as antipsychotic drugs 
shows better binding with the mutants as compared to the 
native protein. DISCI had five damaging mutants (L607F, 
S704C, GSV, L330F, and T328N) which may have a role in 
schizophrenia. The drug lurasidone showed better binding 
energies with mutant form except for T328N as compared 
with native protein shown in table 7. In the case of T328N 
drug, haloperidol showed better results in comparison with 
DISCI protein. Thus these mutants may also be responsible 
for schizophrenia. In the case of NRGI protein, lurasidone 
shows least binding energy of -6.7 kcal/mol. The mutation 
R38Q had a damaging effect on the protein also showed least 
binding energy of -7.0 kcal/mol, which is less than the native 
protein, thus the polymorphism R38Q may have role in caus- 
ing the schizophrenia. 


PRODH is also one among the genes responsible for schizo- 
phrenia which had shown least binding energy with risperi- 
done (-7.5 kcal/mol). There were three deleterious mutations 
in PRODH (Q521R, T466M and R453C) which showed 
better binding energies of -8.1 kcal/mol, -9.3 kcal/mol and 
-10.1kcal/mol respectively against risperidone than PRODH 
protein. Thus these mutations may have a role in the cause 
of schizophrenia. In the case of RGS4 only one mutation 
(R134W) was found to be deleterious, but when antipsychot- 
ic drugs were docked against the mutant structure they don’t 
display good results as compared to the native protein. So 
it can be inferred that the mutant structure R134W may not 
have many roles in the pathophysiology of the disorder. The 
docked images of nine proteins and their mutant forms with 
their respective ligands are shown in Figure 4. 
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CONCLUSIONS 


Schizophrenia is a serious disorder that is gradually posing 
a threat to human life. Despite enormous research, the cause 
behind the disorder is unknown. According to the GWAS, 
there are many genes associated with the disorder. In the cur- 
rent study approaches and servers were used to identify the 
putative cause behind the disorder. Nine genes were prior- 
itized using the text mining approach. The encoded proteins 
were checked for disease-associated mutations using SIFT 
and PolyPhen servers. The proteins and their damaging mu- 
tants were modelled and docked with antipsychotics to find 
the binding energy between them. The drugs were shown to 
have high binding energy with mutants as compared to na- 
tive proteins. The interaction energy is considered as best or 
optimum if the complex is thermodynamically stable with 
the release of maximum energy thus stabilizing the interac- 
tion. Thus from the study, it was concluded that there is a 
deleterious mutation in the studied nine proteins may be the 
cause behind the disorder. 
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Table 1: Table showing the list of genes with their accession number 


S.No. Genes Accession Number 

1. DTNBPi (Dystrobrevin Binding NC 000006.12 
Protein 1) 

2. DRD3 (Dopamine receptor D3) NC_000003.12 


COMT (Catechol-O-Methyltrans- 
ferase) 


NRGi (Neuregulin 1) 


NC. 000022. 


NC. 000008. 


Description 


Plays a pivotal role in regulating the glutamatergic system. 
It is located on chromosome 6p22.3. 


D3 receptor is mediated by G proteins which inhibit ade- 
nylyl cyclase. It is situated at 3q13.31 chromosome. 


Mammalian enzyme known to be involved in metabolic 
degradation of catecholamines 


Signalling molecule which has an important role in the 
organ system growth 
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Table 1: (Continued) 


S.No. Genes Accession Number Description 
GRM3 (Glutamate Metabotropic NC 000007.14 Neurotransmitter in the mammalian CNS (Central Nerv- 
Receptor 3) ous system), involved in normal brain functions. 
DAOA (D-amino acid oxidase NC 000013.11 Functions as an activator of D-amino acid oxidase and are 
activator) involved in the breakdown of gliotransmitter D-serine 
PRODH (Proline dehydrogenase) | NC 000022.11 Enzyme converting proline into D-1-pyrroline-5-carboxy- 

late 

DISCI (Distrupted in schizophrenia NC_o00001.11 Gene responsible for mental illness due to its association 
1) with dopamine impairments. 
RGS4 (Regulator of G protein sig- | NC_ooooo1.u Protein which has a role in modulating signalling through 
nalling 4) G-protein pathways 


Table 2: Antipsychotics drugs used against schizophrenia 


S.No. Generic Name Modeof administration Recommended FDA status Indications PubChem Id 
dose 





First-generation antipsychotics 



































1. Haloperidol Oral, Intramuscular 4-12 mg/d Approved in Schizophrenia CID 3559 
1986 
2. Fluphenazine Oral, Intramuscular 2.5-10 mg/d Approved in Schizophrenia and CID 3372 
1960 Bipolar disorder 
Second-generation antipsychotics 
3. Aripiprazole Oral, Injection 10-15 mg/d Approvedin O Schizophrenia, CID 60795 
2002 bipolar disorder 
4. Clozapine Oral 300-450 mg/d Approvedin ‘Treatment-resistant CID 135398737 
1989 schizophrenia 
5. Iloperidone Oral 12-24 mg/d Approvedin Acute schizophre- CID 71360 
2009 nia 
6. Lurasidone Oral 40-80 mg/d Approved in Schizophrenia CID 213046 
2010 
pA Risperidone Oral; intramuscular 4-8 mg/d Approvedin Schizophrenia CID 5073 
1993 
800 MCOMT 
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0 T T | T T E T T T T 1 mw DRDR3 
WA WP uh u^ ue O Gc oe 
wo QU OD G 
SPM un e as P CEO 
e wv 
Genes 











Figure 1: Graph showing all nine genes with their published literature. 
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Table 3: SIFT result of nsSNPs of the studied genes 





dbSNP ID Nucleotide Change Amino acid Change Tolerance 
index 
COMT 
TS13306281 G/A V92M 0.002 
1976452330 G/A D94N 0.005 
19139449932 C/T R234C o 
TS144463570 C/T R2uW 0.002 
rs6267 T/C A728 o 
TSL45561434 C/G h73M 0.004 
19149909767 G/A G7oR 0.002 
1S199710929 C/T Ri25C 0.002 
1s4986871 A/G A146V o 
TS200150695 G/A Ri84H 0.002 
19201922528 A/T ho4F 0.003 
18373611092 A/G MgoV o 
TS376273380 C/A A168D 0.001 
DAOA 
192391191 G/A R3oK o 
19367543078 C/G N42K o 
19367543079 G/T V85F o 
19367543080 C/T P20S o 
18367543081 A/G K74R o 
19367543081 A/G Ki45R o 
1872549492 C/A A47D o 
1872549492 C/A An8D o 
1872549492 C/G A47G o 
1S9558562 C/G K62E o 
1872549493 C/G Q65E o 
r9138223180 G/A D5oN o 
19138223180 G/A DiziN o 
1s187721661 G/C R648 o 
1$200207534 T/C CuR o 
1$200207534 T/C C82R o 
TS200951630 G/A G68S o 
19371012913 C/T P128 o 
1$371012913 C/T P83S o 
1$371558248 A/G D9G o 
1S373343564 C/T RsıC o 
1S373343564 C/T Rı22C o 
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Table 3: (Continued) 


dbSNP ID Nucleotide Change Amino acid Change Tolerance 
index 
DISC1 
186675281 C/T L607F 0.001 
18367543092 C/T T5731 0.001 
19367543093 A/G K577E 0.001 
rs28930675 C/T T453M 0.004 
rs34622148 C/T L330F 0.001 
rs76175896 C/T A83V 0.003 
TS76230451 A/G T561A 0.003 
rs78640112 G/T V35oL 0.001 
1S34622148 C/T L330F o 
Ts138886515 G/A E470K o 
TS138886515 G/A Ei20K o 
TSI39091980 A/G E161G 0.003 
TSI39420445 C/T $216L o 
1855795950 G/T T328N o 
TS143165003 C/A P540Q o 
19146439119 G/A R223H o 
19147158825 C/T P539L o 
rs148111679 C/T R569W 0.001 
TS821616 G/A $704C 0.004 
TS192018321 C/G P586A 0.002 
TS199530992 C/A S237Y 0.001 
TS199893176 C/G H256D 0.004 
TS200669845 C/G A530G 0.004 
19201556843 A/C E236A o 
TS3738400 C/T G5V 0.001 
18367627719 G/A G55R 0.001 
1S370202687 C/T T6151 0.003 
19377426796 G/C A481P o 
DRD3 
1976256558 C/G W85C 0.005 
75141573183 G/A R148W 0.001 
19143935709 C/T E57K 0.003 
19144644130 G/A Tışl 0.005 
19148428613 T/C N342D o 
19148428613 T/C N375D o 
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Table 3: (Continued) 


dbSNP ID Nucleotide Change Amino acid Change Tolerance 
index 
19149736958 G/A R254C o 
Ts181422088 C/T Vı571 o 
TSs199862630 G/T P135H o 
TS200010990 G/A RiggC o 
TS200269629 C/G Su7T o 
18200875766 G/A R220W 0.004 
19200897022 A/G h24T o 
TS201102020 C/T R58Q 0.001 
1S201504870 G/T P178T 0.005 
1S201708355 T/A I364F 0.005 
TS201882973 A/G Ms2T 0.002 
TS201888918 C/G V334L 0.002 
TS202230210 G/A T155M 0.005 
18368221644. C/T RıggH 0.004 
DTNBPı 
TS370158071 A/G V207A 0.004 
1877460377 C/G R548 0.002 
TS367543103 G/C P317R 0.002 
1367543103 G/C P318R 0.002 
TS367543103 G/C PısıR 0.002 
1916876589 C/T G214D 0.002 
1s16876589 C/T G215D 0.003 
TS142075419 A/G 5230P o 
18142075419 A/G $231P o 
TS142075419 A/G S74P o 
TS144019618 A/G F255L o 
TS144019618 A/G F256L o 
TS144019618 A/G Fo9L o 
TS147011671 A/G 1218T 0.003 
TS147011671 A/G Iz19T 0.003 
18200731587 T/A D174V 0.002 
18200731587 T/A D175V 0.002 
TS200731587 T/A Di8V 0.002 
TS201020144 C/A D329Y 0.001 
TS201020144 C/A D330Y 0.001 
1S201020144 C/A D173Y 0.002 


119 Int J Cur Res Rev | Vol 12 e Issue 20 « October 2020 


Tufchi et al.: In-silico Analysis of Deleterious Single Nucleotide Polymorphisms (SNPs) and Molecular Docking... 


Table 3: (Continued) 


dbSNP ID Nucleotide Change Amino acid Change Tolerance 
index 
19370162147 C/A G73V 0.004 
1S372560190 A/G L238 o 
1S373060790 C/T V76M 0.003 
1S373182049 T/C Qo8R o 
18376313138 C/T V79M 0.001 
18376313138 C/T V235M 0.002 
18376313138 C/T V236M 0.002 
1S377223155 G/C $321C 0.004 
1S377223155 G/C $322C 0.004 
19377223155 G/C $165C 0.005 
GRM3 
TS17856664 C/G P512A 0.002 
TS17856664 C/G P384A 0.002 
1817856664. C/G PıogA 0.005 
TS141671463 C/T T758M o 
TS141671463 C/T T630M o 
TS141671463 C/T T350M o 
rs199660204 T/C F48S o 
TS199660204 T/C F456S 0.001 
TS199660204 T/C F328S 0.001 
19200125543 C/T R66C o 
TS200125543 C/T R68C o 
TS200125543 C/T R68C 0.002 
TS200125543 C/T R68C 0.004 
TS201158915 C/G A73G o 
TS201158915 C/G A75G o 
TS201158915 C/G A75G 0.005 
18267601607 G/A E767K o 
r9267601607 G/A E639K o 
TS267601607 G/A E359K o 
1S370197727 C/T R668C o 
1S370197727 C/T R540C o 
1S370197727 C/T R260C o 
1S372311811 C/T R204C 0.001 
TS372311811 C/T R206C 0.001 
1S372311811 C/T R78C 0.003 
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Table 3: (Continued) 


dbSNP ID Nucleotide Change Amino acid Change Tolerance 
index 
TS373913639 G/A G464R 0.001 
1S373913639 G/A G336R 0.001 
1S373913639 G/A G56R 0.002 
19374144916 G/T C549F 0.001 
18374144916 G/T C421F 0.001 
18374144916 G/T CıgıF 0.001 
1$374569530 C/T T778M 0.001 
1$374569530 C/T T650M 0.001 
1$374569530 C/T T370M 0.001 
1S375977388 T/C I691T 0.001 
19375977388 T/C 1563T 0.001 
1S375977388 T/C 1283T 0.001 
TS377189890 G/T Ru4S 0.004 
NRG 
18367543162 G/T Kn6N 0.005 
195367543163 C/G R250G 0.003 
19367543163 C/G R229G 0.003 
18367543163 C/G R279G 0.003 
195367543163 C/G R352G 0.003 
19367543163 C/G R287G 0.004 
18367543163 C/G Rı22G 0.004 
TS367543163 C/G R276G 0.004 
153924999 C/T R38Q o 
19367543168 C/T P463L o 
18367543168 C/T P586L o 
19367543168 C/T P356L o 
TS367543168 C/T P510L 0.002 
1$367543168 C/T P521L 0.002 
TS367543168 C/T P518L 0.002 
TS367543168 C/T P513L 0.002 
1873672607 C/A P574H 0.002 
1873672607 C/A P608H 0.003 
1873672607 C/A P553H 0.003 
1873672607 C/A P676H 0.003 
1873672607 C/A P6uH 0.003 
1S73672607 C/A P603H 0.003 
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Table 3: (Continued) 


dbSNP ID Nucleotide Change Amino acid Change Tolerance 
index 
1876599953 C/T H306Y o 
1976599953 C/T H371Y 0.004 
1876599953 C/T HigY 0.004 
1976599953 C/T H248Y 0.004 
TS76599953 C/T H298Y 0.004 
1S76599953 C/T H295Y 0.005 
1876599953 C/T H295Y 0.005 
1976599953 C/T H303Y 0.005 
r976810404 C/A S136Y 0.005 
rs80127039 C/T R545W 0.001 
rs80127039 C/T R495W 0.001 
TS80127039 C/T R618 W 0.001 
1880127039 C/T R542W 0.001 
TS80127039 C/T R388W 0.001 
TS80255389 G/A V3gıl 0.002 
1880255389 G/A V320l 0.003 
TS114135581 C/A N357K o 
TS114135581 C/A N365K o 
TS114185597 C/T R619W 0.001 
TS114185597 C/T R389W 0.001 
TS114185597 C/T R543W 0.001 
TS114185597 C/T R554W 0.001 
TS114185597 C/T R551W 0.002 
18114185597 C/T R546W 0.002 
TS115604365 T/G H24Q 0.002 
TS115604365 T/G H233Q 0.002 
19115604365 T/G Hi44Q o 
19115604365 T/G Hi23Q o 
TS115604365 T/G H246Q o 
TS115604365 T/G H178Q 0.003 
19116183863 C/A $523R 0.001 
19139436076 A/C E422A 0.005 
19139436076 A/G E422G 0.003 
TS141355195 G/A R619Q 0.005 
TS146885321 C/T R98C 0.002 
TSL47189312 C/T R6uC 0.003 
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Table 3: (Continued) 


dbSNP ID Nucleotide Change Amino acid Change Tolerance 
index 
TS148350929 G/A A1o2T 0.004 
TS376791440 C/T T460M 0.002 
TS376169851 G/A $407N 0.004 
PRODH 
1S142346005 T/C Y443C o 
TS147233639 G/A P388L 0.001 
TS112389430 G/A R396C 0.004 
TS450046 G/A Q521R 0.004 
1$138400750 C/A C218F 0.001 
TS140831950 C/T R443Q 0.003 
TS140831950 C/T R335Q 0.003 
TS143011525 C/T R324H 0.002 
TSI43011525 C/T R432H 0.003 
TS146889635 C/T V446M 0.001 
TS146889635 C/T V554M 0.001 
18184218784 G/A R577W 0.004 
TSI99714362 C/T D5oN 0.005 
19201627713 C/T A337T 0.002 
TS201627713 C/T A445T 0.003 
18367841908 G/C Hi32D 0.005 
19368452830 A/G L420P 0.003 
19368452830 A/G L528P 0.003 
19369277468 T/C N391S o 
18369277468 T/C N499S o 
TS370792497 G/A R323C 0.001 
19370792497 G/A R431C 0.001 
TS2870984 G/A T466M o 
1S370393004 G/A R563C o 
TS372030860 A/T F5Y 0.002 
1372030860 A/T Fu3Y 0.002 
18372187772 G/C O418E o 
1S372423306 C/T R399Q 0.003 
193970559 G/A R453C 0.005 
19377373292 G/A R579W 0.005 
RGS4 
18368678746 A/T Ru6W o 
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Table 3: (Continued) 


dbSNP ID Nucleotide Change Amino acid Change Tolerance 
index 
19368678746 A/T Ri34W o 
19368678746 A/T R231W o 
1S372256208 G/A C280Y 0.005 


Table 4: List of SNPs for nine genes by using PolyPhen 


SNP Id Position aal aa2 Prediction 

COMT 
rs4680 158 V M benign 
rs6267 72 A S possibly damaging 
SNP Id Position aal aa2 Prediction 
186270 34 C S benign 
184986871 146 A V probably damaging 
1$5031015 102 A T benign 
1945593642 14 R L benign 
1945593642 14 R L benign 
1$5031015 102 A T benign 

DAOA 
1S2391191 30 R K possibly damaging 
189558562 62 K E probably damaging 
18367543078 42 N K benign 
18367543079 85 V F benign 
19367543080 20 P S benign 

DISC 1 
1S3738401 264 R Q benign 
186675281 607 L F probably damaging 
rs821616 704 S [à probably damaging 
1S3738400 5 G V possibly damaging 
1534622148 330 L F probably damaging 
1855795950 328 T N possibly damaging 
TS56020408 116 A V benign 
1$56229136 16 A V benign 

DRD3 
rs6280 9 S G benign 
rs181422088 157 V I possibly damaging 

DTNBPı 

1917470454 272 P S benign 
1816876589 214 G D probably damaging 
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Table 4: (Continued) 


GRM3 

TS17161026 475 G D benign 

NRG: 

193924999 38 R Q possibly damaging 
191050392 289 M T benign 

PRODH 

19450046 521 Q R possibly damaging 
1s1807467 455 A S benign 

1$2238731 427 V M possibly damaging 
1S2870984 466 T M possibly damaging 
TS2904551 441 L P probably damaging 
1$2904552 431 R H probably damaging 
193970559 453 R C probably damaging 
184819756 185 R Ww benign 

rs2008720 19 Q P benign 

rs2870983 472 A T benign 

193970555 406 P L probably damaging 

RGS4 

1914665 195 A S Benign 
18368678746 134 R W probably damaging 


Table 5: Root Mean Square Deviation (RMSD) of native proteins with their respective mutants 


RMSD Energy after energy minimization 


Native COMT protein (4PYI) 2.593 A9 -9350.167 kJ/mol 
A146V with COMT protein 2.682 A“ -9302.922 kJ/mol 
A72S with COMT protein 2.689 A? -9364.405 kJ/mol 
Native DAOA protein 2.42 A9 -1739.696 kJ/mol 
R30K with DAOA 2.43 AP -1800.172 kJ/mol 

K62E with DAOA 1.83 A9 -1562.715 kJ/mol 

Native DISC: protein 2.30 A9 -3368.829 kJ/mol 
L6o7F with DISC1 2.67 AT -3466.654 kJ/mol 
$704C with DISC1 2.49 A9 -3392.598 kJ/mol 
G5V with DISCi 1.75 AC -1739.696 kJ/mol 
L330F with DISC1 1.98 A9 -3295.540 kJ/mol 
T328N with DISC1 2.52 A“ -3420.069 kJ/mol 
Native DRD3 protein 3.65 A? -18798.566 kJ/mol 
Vi57l with DRD3 3.89 A? -19337.301 kJ/mol 
Native DTNBP1 protein 2.54 A? -4446.120 kJ/mol 
G214D with DTNBPı 2.78 AC -4570.659 kJ/mol 
Native NRGi protein 2:3 AU -4865.479 kJ/mol 
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Table 5: (Continued) 


R38Q with NRG1 
Native PRODH protein 
Q521R with PRODH 
T466M with PRODH 
R453C with PRODH 
Native RGS4 protein 
R134W with RGS4 


* Root Mean Square Deviation 


RMSD Energy after energy minimization 
2.9 A9 -5471.062 kJ/mol 

1.52 A9 -2856.177 kJ/mol 

1.96 A? -27330.975 kJ/mol 

1.63 A? -26642.139 kJ/mol 

1.60 A? -24870.443 kJ/mol 

1.68 A9 -6045.847 kJ/mol 

1.75 AP -6381.265 kJ/mol 


Table 6: Stabilizing residues in native and mutant protein structures 


Description 


Stabilizing residues in Native COMT 
protein 


Stabilizing residues in A146V mutant 
COMT protein 


Stabilizing residues in A725 mutant 
COMT protein 


Stabilizing residues in Native DAOA 
protein 


Stabilizing residues in R30K mutant 
DAOA protein 


Stabilizing residues in K62E mutant 
DAOA protein 


Stabilizing residues in Native DISCi 
protein 


Stabilizing residues in L607F mutant 
DISCI protein 


Stabilizing residues in $704C mutant 
DISCI protein 


Stabilizing residues in G5V mutant 
DISCI protein 


Stabilizing residues in L330F mutant 
DISCI protein 


Stabilizing residues in T328N mutant 
DISCI protein 


Stabilizing residues in Native DRD3 
protein 


Stabilizing residues in V157I mutant 
DRD3 protein 


Stabilizing residues in Native DTNBP1 
protein 


Stabilizing residues in G214D mutant 
DTNBP1 protein 


Stabilizing residues in Native NRG1 protein 


Stabilizing residues in R38Q mutant 
NRGI protein 


Stabilizing residues in Native PRODH 


Stabilizing residues 


Leun2, Leung, Glun4, Thri38, Glu14o, Vali88 
Leun2, Leung, Glun4, Thri38, Glu14o, Vali88 
Leun2, Leung, Glun4, Thri38, Glu14o, Vali88 
Vals, Vala, Vals, Gly7, Lys33, Val34, Ala36, Gly131, Ser136, 


Leu139, Ile178, Valı79, Met203, Phe213, Pro284, Ile306 


Vals, Vala, Vals, Gly7, Lys33, Val34, Ala36, Gly131, Ser136, 
Leu139, Ile178, Valı79, Phe213, Ile306 


Vals, Vala, Vals, Gly7, Lys33, Val34, Ala36, Gly131, Ser136, 
Met203, Phe215, Pro284, Ile306 
Leu1006, Ser1008, Valio45, Alano6 
No stabilizing residue was found 
No stabilizing residue was found 
No stabilizing residue was found 
No stabilizing residue was found 
No stabilizing residue was found 
Asn47 

No stabilizing residue was found 
Val53, Gly89 


Val53, Gly89, Vali56, Vali57, Ala159 


Val58, Tyr6ı, Ile79, Ile8o, Gly375, Phe376 


No stabilizing residue was found 


Asp61, Serg7, Vali30, Argi31, Gly159, Ser164, Argı84, Val222, 


Int J Cur Res Rev | Vol 12 + Issue 20 - October 2020 


Tufchi et al.: In-silico Analysis of Deleterious Single Nucleotide Polymorphisms (SNPs) and Molecular Docking... 


Protein Ala274, Tyr275, Val276, Pro277, Tyr278 

Stabilizing residues in Q521R mutant Asp61, Ileg6, Serg7, Vali3o, Gly159, Ile160, Ser164, Leu183, 
PRODH protein Argi84, Val222, Ala274, Tyr275 

Stabilizing residues in T466M mutant Asp61, Ileg6, Argi31, Gly159, Ser164, Ala274, Tyr275 
PRODH protein 

Stabilizing residues in R453C mutant Asp61, Serg7, Vali3o, Argi31, Gly159, Arg184, Val222, Ala274, 
PRODH protein Tyr275 

Stabilizing residues in Native RGS4 Met107 

protein 

Stabilizing residues in Ri34W mutant Met107 

RGS4 protein 


Table 7: Binding energies of studied proteins and their mutant structures against antipsychotic drugs 


Protein Ligand Binding Affinity (kcal/mol) 
DTNBPı Aripiprazole -4.9 
Clozapine -5.7 
Fluphenazine -5.5 
Haloperidol -5.1 
Iloperidone -5.9 
Risperidone -6 
Lurasidone -6.1 
DTNBPı G214D Aripiprazole -5.3 
Clozapine -6.8 
Fluphenazine 75.9 
Haloperidol 75.9 
Iloperidone -5.4 
Risperidone -7 
Lurasidone -6.2 
DRD3 Aripiprazole -7.9 
Clozapine -8.1 
Fluphenazine -7.5 
Haloperidol -9 
Iloperidone -8.4 
Risperidone -9.4 
Lurasidone -9.2 
DRD3_V1571 Aripiprazole -74 
Clozapine -8.1 


127 Int J Cur Res Rev | Vol 12 « Issue 20 - October 2020 


Tufchi et al.: In-silico Analysis of Deleterious Single Nucleotide Polymorphisms (SNPs) and Molecular Docking... 


Fluphenazine -7.5 
Haloperidol -8.4 
Iloperidone -8 
Risperidone -10.4 
Lurasidone -8.6 
COMT Aripiprazole -9.5 
Clozapine -7.8 
Fluphenazine -8.4 
Haloperidol -8.8 
Iloperidone -8.4 
Risperidone -9.4 
Lurasidone -8.9 
COMT_A72S Aripiprazole -81 
Clozapine -8.8 
Fluphenazine -7 
Haloperidol -9.8 
Iloperidone -9.6 
Risperidone -10.9 
Lurasidone -9.9 
Aripiprazole -9.5 
Clozapine -7.8 
Fluphenazine -8.4 
Haloperidol -8.8 
Iloperidone -8.4 
COMT_A146V Risperidone YA 
Lurasidone -8.9 
Aripiprazole -8.5 
Clozapine dia 
Fluphenazine -7.9 
Haloperidol -5.2 
Iloperidone -6.6 
PAOA Risperidone -8.9 
Lurasidone 73 
Aripiprazole -9.3 
Clozapine -84 
Fluphenazine -6.8 
DAOA_R30K Haloperidol E 
Iloperidone -7.6 
Risperidone -9.7 
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DAOA_K62E 


DISC1 


DISC1 L607F 


DISCi_ S7o4C 


DISCı, G5V 


DISCı L330F 
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Lurasidone 
Aripiprazole 
Clozapine 
Fluphenazine 
Haloperidol 
Iloperidone 
Risperidone 
Lurasidone 
Aripiprazole 
Clozapine 
Fluphenazine 
Haloperidol 
Iloperidone 
Risperidone 
Lurasidone 
Aripiprazole 
Clozapine 
Fluphenazine 
Haloperidol 
Iloperidone 
Risperidone 
Lurasidone 
Aripiprazole 
Clozapine 
Fluphenazine 
Haloperidol 
Iloperidone 
Risperidone 
Lurasidone 
Aripiprazole 
Clozapine 
Fluphenazine 
Haloperidol 


Iloperidone 


Risperidone 
Lurasidone 
Aripiprazole 
Clozapine 


Fluphenazine 


-6.7 
-8.6 
74 
-5.0 
-6.9 
-5.2 
-8.7 
73 
7555 
-5.8 
-5.8 
-6.6 
-5.2 
-6.3 
-6.1 
-4.8 
753 
753 
-6.1 
75.9 
-6.3 
-6.4 
-5.6 
-5-7 
-5-4 
-5.6 
-5.8 
-6.3 
-6.2 
-5.6 
753 
75-7 


-6.1 
-5-4 
-5.8 


755 
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Haloperidol -5.9 
Iloperidone -6.1 
Risperidone -6.2 
Lurasidone -6.4 
DISCı T328N Aripiprazole -5.9 
Clozapine 53 
Fluphenazine -5.4 
Haloperidol -6.5 
Iloperidone -6 
Risperidone -6.2 
Lurasidone -6.2 
NRGi Aripiprazole -6.3 
Clozapine -6.5 
Fluphenazine -6 
Haloperidol 77 
Iloperidone -6.4 
Risperidone -7-2 
Lurasidone -6.7 
Aripiprazole -6.5 
Clozapine -5-9 
Fluphenazine -5.6 
Haloperidol -6.5 
Iloperidone -6.5 
NRGi. R38Q Risperidone -6.9 
Lurasidone -7 
Aripiprazole -71 
Clozapine -6.8 
Fluphenazine “7.1 
Haloperidol -73 
Iloperidone -5.8 
PRODH Risperidone -7.5 
Lurasidone -8 
Aripiprazole -6.4 
Clozapine -7.6 
Fluphenazine 7 
Haloperidol -7.6 
Iloperidone -7.4 
PRODH Q521R 
Risperidone -84 
Lurasidone -8.2 
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PRODH T466M Aripiprazole -71 
Clozapine -7.3 
Fluphenazine -7.3 
Haloperidol -8.3 
Iloperidone -8.5 
Risperidone -9.3 
Lurasidone -9.2 
Aripiprazole -5.8 
Clozapine -8.3 
Fluphenazine -7.3 
Haloperidol -71 
Iloperidone -6.8 
PRODH_ R453C Risperidone -10.1 
Lurasidone 7.5 
Aripiprazole -6.5 
Clozapine -6.3 
Fluphenazine -6.4 
Haloperidol -6.3 
RGS4 Iloperidone -6.7 
Risperidone -73 
Lurasidone -7.4 
Aripiprazole 3-7 
Clozapine -2.2 
Fluphenazine -4.5 
RGS4_R134W Haloperidol -2.2 
Iloperidone -5.2 
Risperidone -1.7 
Lurasidone -3.7 
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Figure 2: 3D structure of (a) COMT protein (b) Mutated COMT 
protein A72S (c) Mutated COMT protein A146V (d) DAOA pro- 
tein (e) Mutated DAOA protein R30K (f) Mutated DAOA pro- 
tein K62E (g) DISC1 protein (h) Mutated DISC1 protein G5V 
(i) Mutated DISC1 protein L607F (j) Mutated DISC1 protein 
S704C (k) Mutated DISC1 protein L330F (I) Mutated DISC1 
protein T328N (m) DRD3 protein (n) Mutated DRD3 pro- 
tein V1571 (o) DTNBP1 protein (p) Mutated DTNBP1 protein 
G214D (q) NRG1 protein (r) Mutated NRG1 protein R38Q (s) 
PRODH protein (t) Mutated PRODH protein R38Q (u) Mutated 
PRODH protein T466M (v) Mutated PRODH protein Q521R 
(w) RGS4 protein (x) Mutated RGS4 proteinR134W 
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Number of residues in favoured region (^98.0X expected) : 72 ( 93.5%) 
Number of residues in allowed region ( ~2.0% expected) : $( 6.5%) 
Number of residues in outlier region t o. 0.) 


Figure 3: Ramachandran plot of proteins (a) DAOA (b) DISC1 
(c) DTNBP1 (d) NRG1 (e) PRODH 
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Figure 4: Docking images of (a) DRD3 with Risperidone (b) 
DRD3 V1571 with Risperidone (c) DTNBP1 with Lurasidone 
(d) DTNBP1 G214D with Lurasidone (e) COMT with Risperi- 
done (f) COMT A1468 with Risperidone (g) COMT A728 with 
Risperidone (h) DAOA with Risperidone (i) DAOA K62E with 
Risperidone (j) DAOA R30K with Risperidone (k) DISC1 with 
Lurasidone (1) DISC1 L607F with Lurasidone (m) DISC1 G5V 
with Lurasidone (n) DISC1 I330F with Lurasidone (o) DISC1_ 
S704C with Lurasidone (p) DISC1 T328N with Lurasidone 
(q) NRG1 with Lurasidone (r) NRG1 R38Q with Lurasidone 
(s) PRODH with Resperidone (t) PRODH Q521R with Ris- 
peridone (u) PRODH R453C with Risperidone (v) PRODH . 
T466M with Risperidone (w) RGS4 with Lurasidone (x) RGS4_ 
R134W with Lurasidone. 
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